Distributionally Robust Policy Gradient for Offline Contextual Bandits

Published in AISTATS, 2023

Recommended citation: Zhouhao Yang, et al. "Distributionally robust policy gradient for offline contextual bandits." International Conference on Artificial Intelligence and Statistics. PMLR, 2023. https://proceedings.mlr.press/v206/yang23f.html